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ABSTRACT 

We analyse the bivariate distribution, in color versus absolute magnitude (u — r vs. M r ), of a low 
rcdshift sample of galaxies from the Sloan Digital Sky Survey (SDSS; 2400 deg 2 , 0.004 < z < 0.08, 
—23.5 < M r < —15.5). We trace the bimodality of the distribution from luminous to faint galaxies by 
fitting double-Gaussians to the color functions separated in absolute magnitude bins. Color- magnitude 
(CM) relations are obtained for red and blue distributions (early- and late-type, predominantly field, 
galaxies) without using any cut in morphology. Instead, the analysis is based on the assumption of 
normal Gaussian distributions in color. We find that the CM relations are well fit by a straight line 
plus a tanh function. Both relations can be described by a shallow CM trend (slopes of about —0.04, 
—0.05) plus a steeper transition in the average galaxy properties over about two magnitudes. The 
midpoints of the transitions (M r — —19.8 and —20.8 for the red and blue distributions, respectively) 
occur around 2 x 10 10 A4q after converting luminosities to stellar mass. Separate luminosity functions 
are obtained for the two distributions. The red distribution has a more luminous characteristic 
magnitude and a shallower faint-end slope (M* = —21.5, a = —0.8) compared to the blue distribution 
(a —1.3 depending on the parameterization). These are approximately converted to galaxy stellar 
mass functions. The red distribution galaxies have a higher number density per magnitude for masses 
greater than about 3 x 10 10 M.q- Using a simple merger model, we show that the differences between 
the two functions are consistent with the red distribution being formed from major galaxy mergers. 
Subject headings: galaxies: evolution — galaxies: fundamental parameters — galaxies: luminosity 
function, mass function. 



1. INTRODUCTION 

Optical color-magnitude (CM) 1 diagrams have been 
used as scientific diagnostics in astronomy since the pi- 
oneering work of E. Hertzsprung and H. N. Russell (c. 
1910). While the CM relations for stars are now well 
established in terms of stellar evolution theory, the case 
for galaxies is less clear. The optical spectra of galax- 
ies are dominated by the integrated light from stellar 
populations, and therefore, the existence of any CM se- 
quence is related to a correlation of galaxy luminosity 
with star- formation history (SFH), stellar initial mass 
function (IMF), chemical evolution and/or dust atten- 
uation. In order to separate CM relations from color- 
morphology relations, most of the study of CM relations 
has concerned galaxies of a similar morphological type. 
The princip al rel ationship between co lor and morphology 
l|Holrnberdll958t iRoberts fc Havneslll994|) is that more 
spheroidal-like galaxies (early types) are generally red- 
der than more disk-like or irregular galaxies (late types). 

A color-magnitude rel ation fo r spher oidal-like systems 
was first established by IBaumI l)1959j) . The integrated 
colors become systematically redder going from globu- 
lar star clusters, through dwarf elliptical galaxies, to 
giant ellipticals. Later, more precise measurements of 
luminous E+S0 galaxies in clusters showe d a shallow 
CM relation with a small intr insic scatter l)FaberlH973t 
iVisvanathan fe Sandagdll977j) . This relation was asso- 

1 Note that in this paper, by 'color-magnitude', we always mean 
'color versus absolute magnitude'. 



ciated with a met allicity-lu minosity correlation (iFabei 
119731: I Larsonll974|) . However. lWorthev. Traeer. fc Fabei 
( 1995) showed that a age-luminosity correlation also fit 
the spectr oscopic data because of the age-metallicity de- 
generacy. iKodama fc Arimotol l)1997j) ruled out the cor- 
relation with age being the primary effect because the 
predicted evolution of the CM sequence with rcdshift 
was more than observed in this case. Thus, the CM 
relation for bright E+S0 galaxies has been established as 
a metallicity-luminosity correlation. The intrinsic scat- 
ter and slope of this 'E+S0 ridge' can be used to place 
constraints on th e star-formation and merging histo ries 
of these galaxies dBower. Kodama. fc Terlevicbl lT998') . 

Color-magnit ude relations for early-type spirals were 
established 
for spirals 

IVisvanathanl l)1981|) : iTullv. Mould, fc Aaronsonl (|1982j). 
These CM relations are more complicated than for the 
E+S0 ridge f or a number of rea sons. The intrinsic is scat- 
ter is larger ijGriers mith 198(3) an d t he luminosity corre- 
lation s can be associate d with SFH llPeletier fc de Grii 4 
1998), dust attenuation (ITullv et aUl998 |) a nd/or metal- 
licity ijZaritskv. Kennicutt. fc Huchralll994j) . It is prob- 
able that for some morphological types and across some 
ranges of absolute magnitude, all three effects are signif- 
icant. 

When all morphological types are considered together, 
the color distribution of galaxies can be approximated 
by the sum of two 'normal' Gaussian functions, a bi- 
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modal function ijStrateva et al.ll200l|) . The bimodality of 
the galaxy population has been known qualitatively for 
some time. Researchers general consider E+SO galax- 
ies to be early typ es and Sa-Sd spira ls and irregulars to 
be late types, and iTullv et alJ l)1982f) noted that "early 
and late morphological types occupy separate branches 
in the color-magnitude diagram" . With the advent of 
large spectroscopic redshift surveys, it is now possible to 
precisely analyse this color bimodality as a function of 
absolute magnitude, for the field population in particular 
(whereas previously clusters offered the best opportunity 
to study CM relations since all the cluster members are 
approximately at the same distance). 

A natural explanation for the bimodality is that the 
two normal distributions represent different populations 
of galaxies that are produced by two different sets of 
processes. In other words, formation processes give rise 
to two dominant populations that have different aver- 
age colors and/or color dispersions. Evidence that the 
color bimodali ty is due to this co mes from the clustering 
analysis of Budavar i et alJ ((2003) . When the galaxy pop- 
ulation was divided into four color bins, the two reddest 
bins showed a similar clustering strength to each other, 
as did the two bluest bins, with a sharp transition in 
properties between them. This can be explained if the 
dominant effect is the fraction of galaxies that are part 
of the red or blue normal distributions, rather than the 
average color of the galaxies. Galaxies that are part of 
t he red distr i bution are more strongly clustered. 

IBell et all l)2003bl) used only colors to define a red se- 
quence from a photometric rcdshift survey. The bimodal- 
ity was observed out to a redshift of unity and the evo- 
lution of the red sequence was quantified. In particular, 
they noted a build up of stellar mass on the red sequence 
by a factor of about two since z = 1. This is inconsistent 
with a scenario where red early-type galaxies form early 
in the Universe and evolve passively to the present day, 
and it favors scenarios where the red sequence derives 
from merger processes. 

For our color analysis, we use data from the Sloan Dig- 
ital Sky Survey (SDSS). The SDSS is unique for studying 
the CM distribution of low-rcdshift galaxies because the 
survey has obtained over 10 5 redshifts for z < 0.1 galax- 
ies with associated five-color photometry. An overview of 
various biv ariate distribut i ons, in cluding CM relations, 
is given bv iBlanton et al. (2003c). Here, we focus on 
one particular color and analyse in more detail the low- 
redshift distribution of galaxies (z < 0.08; u — r vs. M r ). 
We also extract luminosity functions for the red and blue 
distributions (early- and late- type galaxies), relate our 
results to stellar mass and consider a merger explanation 
for the bimodality. The plan of the paper is as follows; in 
Section |2 we describe the SDSS data and sample selec- 
tion; in Sectional we show the CM bivariate distribution; 
in Section^] we describe our assumptions, aims and the 
parametric analysis of the distribution, and; in Sections[5] 
and El we present our results and conclusions. A simple 
merger model is described in the Appendix. 

2. THE SDSS DATA AND SAMPLE SELECTION 

The Sloan Digital Sky Survey i|York et al I 1200?!: 
iStoughton et al.ll200 2:) is a project, with a dedicated 2.5- 
m telescope, designed to image 10 4 deg 2 and obtain spec- 
tra of 10 6 objects. The imaging covers five broadbands, 



ugriz with effective wavelengths of 355, 4 67, 616, 747 
and 892 nm, using a mosaic CCD camera ijGunn et alJ 
Il998|) . Observations w ith a 0.5-m photometric tele- 
scope dHogg et al.H200lD are used to calibrate the 2.5-m 
teles cope images using the u'q'r'i'z' standar d star sys- 
tem (|Fukugita et all 119961: iSmith et all 120021 Spectra 
are obtained using a 640-fiber fed spectrograph with a 
wavelength ran ge of 380 to 9 20 nm and a resolution of 
A/AA ~ 1800 ifUomoto et alJ 119991) . In this paper, we 
analyse a sample of g alaxies selected from the SDSS main 
galaxy sample (MGS: IStrauss et al.l20 02) that selects ob- 
jects for spectroscopic followup to a limiting magnitude 
in the r-band. 

The imaging data are astrometrically calibrated 
(jPier et alJ 12003(1 and the images are reduced using a 
pipeline photo that measures the observing conditions, 
and detects and measures objects. In particular, photo 
produces various types of magnitude measurement in- 
cluding: (i) 'Petrosian', the summed flux in an aperture 
that depends on the surface-brightness profile of the ob- 
ject, a modified version of the flux quantity defined by 
iPetrosianl l)1976|) : (ii) 'model', a fit to the flux using the 
best fit of a de-Vaucouleurs and an exponential profile; 
(iii) 'PSF', a fit using the local point-spread function. 
The ma gnitudes are extinction-corrected usi ng the dust 
maps of lSchlegel. F inkbeiner. & Davis ( 1998). De t ails o f 
the imaging pipelines are g iven bv iLuoton et all {2001) 
and lStoughton et aT] ((2002). 

Once a sufficiently large area of sky has been imaged, 
the data are analysed using 'targeting' software routines 
that determine the objects to be observed spectroscopi- 
cally. The MGS has the following basic criteria: 

rPctro<nimit (1) 
Mr, 50 < Mr, 50, limit (2) 
?"PSF — Tmodcl > Shmit ■ (3) 

The first equation sets the magnitude limit of the sur- 
vey. The second equation sets the surface-brightness 
limit (Mr,50 is the mean surface brightness within the 
Petrosian half-light radius). This is necessary to avoid 
targeting too many objects that are instrumental arti- 
facts. The third equation is used for star-galaxy sepa- 
ration. The limits have been modified since the begin- 
ning of the survey but over most of the survey, they are 
given by ri imit = 17.77, Mr,50, limit = 24.5 mag arcsec" 2 
and sumit = 0.3. The targets from all the samples (oth- 
ers include luminous red galaxies and quasars) are then 
assigned t o plates, each with 64 fibers, using a tiling 
algorithm (IBlanton et al.ll2003aD. Detail s of the MGS 
selection are given bv lStrauss et al.l (|2002f ). 

Spectra are taken using, typically, three 15-minute ex- 
posures in moderate conditions (the best conditions are 
used for imaging). The signal-to-noise ratio (S/N) is typ- 
ically 10 per pixel (pixels width ss 1-2A) for galaxies in 
the MGS. The pipeline spec2d extracts, and flux and 
wavelength calibrates, the spectra. The spectra are then 
analysed by another pipeline that classifies and deter- 
mines the redshift of the object. 

2.1. Subsample Selection from the Main Galaxy Sample 

We use a well-defined subsample of the MGS called 
'NYU LSS samplel2' covering 2400 deg 2 . We set limits 
on the magnitude as follows: 13.5 < r < 17.5 over 30% 
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and 13.5 < r < 17.77 over 70% of the area. The 17.5- 
limit corresponds to earlier targeting when the imaging 
and targeting pipelines were significantly different from 
the 17.77-limit. 2 This produces a sample of 207654 ob- 
jects, of which, 94% have been observed spectroscopi- 
cally. The remaining 6% are primarily missed due to 
'fiber collisions', which means that the tiling pipeline is 
unable to assign fibers due to another target being less 
than 55" away. This is a limit imposed by the plate and 
fiber technology. When two or more MGS targets are 
within 55" of each other, a fiber is assigned at random to 
one of them. Of the spectroscopically observed targets, 
99.5% have reliable redshifts determined and, of these, 
97.7% are galaxies with redshifts between 0.001 and 0.3. 

We further restrict our sample to a low redshift range 
of 0.004 < z < 0.080 and a range in absolute magnitude 
of -23.5 < M r < -15.5, given by 

M r = r - fc r - 5 \og{D L / 10 pc) (4) 

where: r is the Milky- Way extinction-corrected, Pet- 
rosian magnitude; Dl is the luminosity distance for 
a cosmology with (fl m ,^A)o = (0.3,0.7) and H n = 
(hro) 70kms~ 1 Mpc -1 , and; k r is the k-correction using 
the method of Bla nton et al.l l)200 3b). 3 This produces a 
sample of 66846 galaxies with reliable redshift measure- 
ments. 

Including higher redshift galaxies can leverage better 
statistics on the bright galaxies but here we are also in- 
terested in the continuity between low and high lumi- 
nosity galaxies. In addition, restricting the sample to 
z < 0.08, red uces evolution effects and uncertainties in 
fc-corrections. iBlanton et al.l l|2003c[) reduced these types 
of uncertainties by fc-correcting to the z = 0.1 band- 
passes. This is optimal for the median redshift galaxies 
in the MGS but is sub-optimal for low luminosity galax- 
ies (only observed near z = 0). Therefore, we keep to 
the standard definition of fc-corrections (to z = 0). This 
also means that no extrapolation is required to get from 
the observed-frame bandpasses to the rest-frame u and r 
bands principally used in our analysis. 

3. THE BIVARIATE DISTRIBUTION 

For a spectral- type indicator, we use the rest-frame 
u — r color defined by 

Cur = ("^modcl — fc«) — (r mo( icl fcr) ■ (5) 

This is used because, even without fc-corrections, the u—r 
color has been show n to be a nearly opti mal separator 
into two color types l|Strateva et alJl200l|l . The u-band 
filter observes flux from below the 4000A break and thus 
any u — X color is highly sensitive to SFH (X = g,r,i 
or z). We determined that using u—r gave the most 
robust results for the analysis presented in this paper 
(though u — g gave a marginally better division by type 
for the more luminous galaxies). 

Model magnitudes are used because they give a higher 
S/N measurement than the Petrosian magnitudes, par- 
ticularly because the u-band flux is generally weak and 

2 Magnitude measurements in this paper were predominantly 
derived from photo v. 5.2. 

3 The fc-corrections were derived from kcorrect v. 1.16. 

4 We use the magnitudes as defined by the SDSS software 
pipelines. To co nvert to AB magnitud es: (u — r)(AB) (u — 
r)(SDSS) - 0.05 ( Ab azaiian et al. 2003]), and to convert to Vega 
magnitudes: (u - r)(Vega) w (it - r)(SDSS) - 0.85. 
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Fig. 1. — Observed bivariate distribution of the sample in rest- 
frame color versus absolute magnitude. The contours are deter- 
mined for galaxy number counts in 0.1 color X 0.5 magnitude bins 
(with a total of 66846 galaxies). The contour levels are on a log- 
arithmic scale, starting at 10 and doubling every two contours. 
The dashed lines represent the limits used in the double-Gaussian 
fitting described in Section HI 



aperture photometry includes significant Poisson and 
background-subtraction uncertainties. In fact, if Pet- 
rosian colors are used, using the it- band may not be op- 
timum. For example, IBlanton et al.1 (2003c) found that 
the bimodality was most evident in the 01 (5 — r) color. 
Note that SDSS model magnitudes are determined using 
the best-fit profile obtained from the r-band image and 
fitting only the amplitude in the other bands. 

The bivariate distribution of the sample in C ur versus 
M r is shown in Figure^ The bimodality is clearly visible 
with two tilted ridges representing the early- and late- 
type galaxies. The other u — X CM distributions appear 
similar (after scaling the color-axis appropriately). For 
the g — X CM distributions, the bimodality is still evi- 
dent (at low luminosities) but the late-type ridge appears 
to merge with the early- type ridge around M r ~ — 20 
whereas this occurs at slightly higher luminosities with 
the u — X colors. This probably reflects the changing 
dependence of dust and SFH on the colors of the late- 
type galaxies. For the remaining CM distributions (r — i, 
1 — z, i — z), the bimodality is not evident as the ridges 
have merged. 

3.1. Correcting for Incompleteness 

Before the distribution is analysed, there are two sig- 
nificant incompleteness issues to deal with: 5 (i) galax- 
ies of a given absolute magnitude and spectral type can 

5 We assume that the surface-brightness limit and star-galaxy 
separation criteria d o not si gnificantly affect the analysis presented 
here. IBlanton et alJ i2IK).'i( : show that the luminosity density due 
to galaxies as a function of surface brightness drops rapidly before 
the limit and Strauss et al. (2002) determined that only 0.3% of 
galaxies brighter than an r magnitude of 17.77 are rejected by the 
star-galaxy separation criteria. In addition, the low redshift sample 
(z < 0.08) analysed here will be less affected by these selection 
biases than the majority of galaxies in an r < 17.77 sample (median 
2 = 0.10). Instead, the brightest galaxies in our sample may suffer 
from deblending problems. Large galaxies are more likely to be 
blended with foreground stars and they are well resolved, which 
means that photo is more likely to measure their fluxes incorrectly 
(by stripping genuine parts from a galaxy). Their colors should be 
less affected since deblending is applied equally in all bands. We 
also assume that this deblending issue does not significantly affect 
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-21 -20 -19 -18 
M r ,Petro - 5 log(hvo) 

Fig. 2. — Bivariatc distribution of the sample in rest-frame color 
versus absolute magnitude, V surV ey/^max corrected. The contours 
are determined for galaxy number counts in 0.1 color X 0.5 magni- 
tude bins. The contour levels are on a logarithmic scale, starting at 
15 and trebling every two contours. The upper and lower dashed 
lines represent a fit to the mean positions of the Gaussian color 
functions for the red and blue distributions, respectively (the fit- 
ting is described in Scc.0] sec also Fig. ISt. 



only be observed within a certain redshift range, which 
in some cases is much less than the redshift range of the 
sample, and; (ii) some galaxies are not observed due to 
fiber collisions. 

To correct for the first issue, we weight each galaxy by a 
Vsurvey /V max factor before recomputing the bivariate dis- 
tribution, where V max is the maximum volume over which 
the galaxy could be observed within the sample redshift 
range (0.004 < z < 0.08, V; urV e y = 9.3 x 10 6 Mpc 3 ). 
We calculate Vmax by iterating to a solution for the k- 



correction at z m ; n and z max - 



The factor, V survcy /V^ax- 



varies from about 1.4 for the brightest galaxies (set by 
r > 13.5), down to 1.01 at M r w -21, up to 450/650 
for the faintest galaxies (set by r < 17.5/17.77). In 
the 17.77-limit region, the sample is virtually volume 
limited between absolute magnitudes of —23 and —20 
(Kmrvey/Knax^S 1.2). Note also that this correction fac- 
tor is principally a function of M r with little dependence 
on color at these low redshifts (r-selected sample), which 
means that this correction is important for the determi- 
nation of the luminosity functions but not for the CM 
relations. 

The class of galaxies that are not observed due to fiber 
collisions is not identical to the whole sample. On aver- 
age, these galaxies will be found in higher density regions. 
A very similar class of galaxies are those that are the 
nearest observed neighbors to the unobserved galaxies. 
These galaxies were, predominantly by chance, allocated 
a fiber instead of their neighbors. To correct for this is- 
sue, we weight these observed galaxies by 2.15. This fac- 
tor is determined from the number of unobserved galaxies 
divided by the number of unique nearest observed neigh- 
bors, plus unity. 

The corrected distribution of galaxies is shown in Fig- 
ure [5] The results of our fitting to the mean color val- 
ues along the red and blue distributions are also shown 
(described later). In the next section, we describe our 

ou r results. Som e discussion of bright-end incompleteness is given 
bv lStrauss et af] 



parametric fitting to the bimodal bivariate distribution. 

4. METHODOLOGY 

First of all, we summarize our assumptions and aims 
before describing our parameterization and fitting proce- 
dure. Our basic assumptions are given below. 

1. There are two dominant sets of processes that lead 
to two distributions of galaxies. 

2. For each distribution, the average spectral proper- 
ties vary contiguously with visible luminosity. This 
is reasonable because luminosity is correlated with 
the mass of a galaxy and gravity determines the 
movement of gas and stars. 

3. At each luminosity, each distribution can be ap- 
proximated using a normal distribution in the dif- 
ference between the near-ultraviolet and visible 
magnitudes (a log-normal distribution in the ra- 
tio between the fluxes). This could result from 
stochastic variations in SFH, metallicity and dust 
content (and inclination in the case of disks). 

Note that for our discussion, we assume tha t the stellar 
IMF is universal l|Wvselll997t lKrouDall200l . 
Our aims are: 

1. to quantitatively determine the variation in the 
mean and dispersion of the spectral colors of each 
distribution, as a function of luminosity; 

2. to determine separate luminosity functions; 

3. to relate the above to physical explanations; 

4. to define a best-fit cut in color versus absolute mag- 
nitude space to divide galaxies by type. 

Our aims differ from other work on early- and late-type 
galaxies, in that, we do not use a cut in morphology 
or spectral type. Instead, the analysis is based on the 
assumption of normal Gaussian distributions. Neverthe- 
less, we can safely assume that the red and blue distri- 
butions, described in this paper, correspond in general 
to the traditional morphological definitions of early and 
late types be cause of the well-known color-morphology 
relations (e.g. I fio berts Havneslll994t iShimasaku et alJ 
12003 iBlanton et al.ll2003c|) . 

4.1. Parameterization 

We assume that the bivariate distribution is the sum 
of two distinguishable distributions: 

$comb = *r + *b (6) 

such that $ dM dC is the number of galaxies between 
M r and M r + dM and between C ur and C ur + dC. The 
parameterization for these, red and blue, distributions is 
given by 

$(M r , C ur ) = <t>{M r ) g[C ur , (j,(M r ), a(M r )} (7) 

where <f> is the luminosity function and Q is the color 
function parameterized using a Gaussian normal distri- 
bution: 



g(C ur ,fj,,a) 



1 



: exp 



'{C ur — 



2a 2 



(8) 
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Both fi and a are constrained to be contiguous functions 
of M r , in particular, a straight line plus a tanh function 
given by 



T(M r ) = po + pi (M r + 20) + q tanh 



M r 



(9) 



This function was found to provide good fits to the data, 
in particular, significantly better fits than polynomials 
with the same num ber of parameter s. The luminosity 
functions are fit with lSchechterl l)197fit ) functions that can 
be written in terms of magnitudes as 

0(M r ) = C( £* e -c(«+l)(M r -M*) e -e-^-«*> (10) 

where: c = 0.4 In 10 (=0.921034); M* and <j>* are the 
characteristic magnitude and number density, and; a is 
the faint-end slope. 

4.2. Fitting 

For the purposes of fitting to the distribution, the sam- 
ple was divided into 16 absolute magnitude bins of width 
0.5 from —23.5 to —15.5. Each of these subsamples was 
divided into 28 color bins of width 0.1 in C ur - The range 
in C ur varied from 0.7-3.5 for the most luminous galax- 
ies bin to 0.0-2.8 for the faintest bin, to approximately 
track the CM relations. 

The procedure for fitting to the distribution is given 
below. 

1. For each absolute magnitude bin, an initial esti- 
mate, by eye, was made for the mean and disper- 
sion of each distribution. 

2. For each absolute magnitude bin, the dis- 
tribution over the color bins was fitted by 
a double- Gaussian function with parameters 
(j) r , /x r , cr r , (j>b, Mb, 0b (Figs. OHH The fitting used 
a weighted least-squares routine with a grid search 
in the \x and a parameters (narrowing from 0.016 to 
0.001). The variance in each separate bin was taken 
as Poisson (the sum of the galaxy weights squared) 
plus a softening parameter for small number statis- 
tics (two times the average weight squared at that 
absolute magnitude) 6 plus a softening for high- 
count bins (five per cent of the mean counts per 
color bin, squared, at that absolute magnitude). 
With these additions to the uncertainties, the re- 
duced x 2 values were on average unity. For the first 
two absolute magnitude bins (M r < —22.5), /Zb and 
<7b were not fitted and were fixed at extrapolated 
values, and for the last two bins (M r > —16.5), n T 
and <7 r were not fitted. This is because the S/N in 
these bins was insufficient for a useful 6-parameter 
fit. 

3. T functions were fitted to a as a function of M r for 
each distribution (Fig. EJ. 

6 The equivalent variance for small number statistics with no 
weighting would be N + 2 where N is the number of measured 
counts. This provides an approximation to uncertainties involved 
with low counts in a Poisson distribution. This expression can be 
derived by assuming a uniform prior in Ntrue (the average counts 
expected, which can be fractional), determining the probabilities of 
measuring N counts for each value of Ntrue, and finally calculating 
the probability-weighted mean-square deviation of Ntrue from TV. 
Using this variance estimate, standard least-squares fitting routines 
can be used with robustness to non-Gaussian outliers. 



4. Each of the absolute magnitude bins was fitted with 
double-Gaussian functions (as per stepEJ except all 
the a values were fixed by the T function fits. 

5. T functions were fitted to fj, as a function of M r for 
each distribution (Fig. EJ. 

6. The procedure was repeated up until this point 
(steps 121 EJ until there was no significant change in 
the T functions. This is necessary because of the 
extrapolation described in step El In other words, 
the fitting to the first and last sets of bins depends 
on the extrapolated values. The result converges 
quickly in one or two repeats. 

7. Each of the absolute magnitude bins was fitted with 
double-Gaussian functions (as per step 0) except 
the \i and a values were fixed by the T function 
fits. In other words, only the amplitudes of the 
Gaussian functions were fitted. 

8. Schechter functions were fitted to the final lumi- 
nosity functions (Fig. 01 . 

To summarize, double-Gaussian functions are fitted to 
the color functions of the galaxy distribution divided 
into absolute magnitude bins. The dispersions of the 
Gaussians are constrained to vary smoothly before refit- 
ting the double-Gaussians, and then the means are con- 
strained. Alternatively, constraining the means prior to 
the dispersions gives a slightly higher total \ 2 with sim- 
ilar overall results. The final set of double-Gaussian fits 
only allow the amplitudes to vary in order to obtain the 
luminosity functions with high S/N. These are fitted with 
Schechter functions. 

5. RESULTS AND DISCUSSION 

FiguresElandlHshow the results of the double- Gaussian 
fitting to the color functions. Visual inspection shows 
that the bimodality in the galaxy population is clearly 
traceable from about an absolute magnitude of —22 to 
— 17, and that a double-Gaussian function provides a 
good representation for the most part. For the high S/N 
mid-range in M r , there are some significant deviations 
but with the additional 5% systematic uncertainty de- 
scribed in Section l4~2*l the reduced \ 2 values are of order 
unity. We will assume that these slight non-Gaussian 
deviations do not affect our results. 

For two of the bins brighter than —22, there are signif- 
icantly more galaxies on the blue side of the red distribu- 
tion, justifying the continued use of the bimodal descrip- 
tion. For the most luminous bin, there is no evidence of 
any blue distribution and we only have an upper limit 
on the density of blue-distribution galaxies here. For the 
three bins fainter than —17, there are more galaxies on 
the red side of the blue distribution than the blue side. 
Note that for the two brightest and two faintest bins, the 
mean and dispersion of the less-populous distribution are 
fixed by extrapolation from the whole population. The 
general trend is for the number-density ratio of the red 
to the blue distribution to increase with luminosity. In 
the following subsections, we describe: in 15.11 the CM 
relations for the two distributions; in 15. 21 the luminosity 
functions; in 15.31 an optimum divider between the two 
types, and; in !5.4l a conversion to stellar mass. 
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Fig. 3. — Color functions for the galaxy distributions in absolute magnitude bins of width 0.5. Each plot shows galaxy number counts 
versus rest-frame u — r color. The crosses with error bars represent the V S urvey/V r ma x corrected counts in 0.1 color bins. The solid lines 
represent double-Gaussian fits while the dashed lines represent the single Gaussians of the blue and red distributions. 



5.1. Color- Magnitude Relations 

To quantify these distributions further, we assume that 
the Gaussian parameters vary smoothly from one abso- 
lute magnitude bin to the next. The dispersion and mean 
of each distribution are fitted by straight line plus tanh 
functions (T functions, five paras., Eqn. EJ. These fits 
are shown in Figures Eland El These T functions provide 
a far superior fit than a five-parameter (4th order) poly- 
nomial. 7 In addition, they are more stable for extrapola- 
tion (a straight line at the outside limits) and they can be 

7 The difference between x 2 (polynomial) and \ 2 (T) ls (1-4, 7.3, 
11.3, 41.5) for (<r r , Ob, fi T , /ib)i respectively. 



related more readily to a physical explanation (a general 
trend with luminosity plus a transition around a partic- 
ular luminosity). Table ^ shows the fitted parameters 
with uncertainties. The parameters pg and p\ represent 
the intercept and slope of the straight line while q , qi 
and q2 represent the amplitude, midpoint and range of 
the transition. 

5.1.1. The red distribution 

One of the most well-studied relations is the 
CM relation for luminous early-type galaxies 
llVisvanathan k, Sandagdll977t iSandage fc Visvanathanl 
Il978albt IBower. Lucev. fc Ellis Il992albl: 
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Fig. 4. — Color functions for the galaxy distributions continued. See Figure|3]for details. 



Table 1 

Color- and dispersion-magnitude relations: T function parameters" 



distribution 




PO 






Pi 












91 






12 






Mr 


2.279 


± 


0.006 


-0.037 


± 


0.006 


-0.108 


± 


0.017 


-19.81 


± 


0.07 


0.96 


± 


0.16 


1.8xl0 10 


G t 


0.152 


± 


0.006 


0.008 


± 


0.006 


0.044 


± 


0.018 


-19.91 


± 


0.18 


0.94 


± 


0.40 


2.0xl0 10 


l-'h 


1.790 


± 


0.014 


-0.053 


± 


0.008 


-0.363 


± 


0.029 


-20.75 


± 


0.05 


1.12 


± 


0.10 


2.6X10 10 


o"b 


0.298 


± 


0.004 


0.014 


± 


0.007 


-0.067 


± 


0.014 


-19.90 


± 


0.07 


0.58 


± 


0.19 


0.9xl0 10 



a The results of fitting a straight line plus a tanh function (Eqn. to the variations, in the means (fi) and dispersions (ex), 
of the red and blue distributions as a function of M r (Eqns. Qand|8J. The p parameters represent the straight line while the 
q parameters represent the tanh function. The fitted lines are shown in Figures and |S| Note that the errors quoted do not 
include systematic uncertainties due to photometric calibration or fc-corrections. 

b The transition midpoint approximately converted to stellar mass (see Section f5. 41 . 
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: (p;q) = (0.298, 0.01-1; -0.067, -19.90, 0.58) 



0.4 r 




: (p;q) = (0.152, 0.008; 0.044, -19.91, 0.94) 



-23 -22 -21 -20 -19 -18 -17 -16 
M r - 5 Iog(h 7 o) 

Fig. 5. — Dispersion-magnitude relations: variation in the dis- 
persion of the rest-frame u — r colors for each galaxy distribution 
from the double-Gaussian fitting (crosses for a r (M r ) and squares 
for o\,(M r ), with vertical error bars and horizontal bars represent- 
ing the width of the magnitude bins). Note that the error bars 
can be smaller than the symbols. The solid lines represent straight 
line plus tanh function (Eqn. |5J fits to the data. The dotted lines 
represent an extrapolation where the parameters are fixed in the 
double-Gaussian fitting. The parameters for the fitted functions 
are shown in the plot. Note that the measured dispersion includes 
observational uncertainties. 
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Fig. 6. — Color-magnitude relations: variation in the mean of 
the rest-frame u — r colors for each galaxy distribution (crosses for 
/i T (M r ) and squares for /n,(Af r ), with vertical error bars). Note 
that the error bars can be smaller than the symbols. The dashed 
line repre sents an optimal divider between the two distributions 
(Sec.|F*3J. The dash- a nd- dotted line shows t he slope of the average 
U - V CM relation of IBower etaD 11992b!) for E+S0 galaxies in 
clusters (—0.087), offset from the red distribution for clarity. See 
Figure|S]for other details. 



iSchweizer & Seitzerl 119921 


Terlevich. Caldwell. & Bowed 


200ll iBernardi et all 2003 


). This corresponds approxi- 



mately to the red distribution with M r < — 20 (Fig. EJ) - 
Our formal slope is about —0.04 (pi for fi T ) but we 
find that the slope gets steeper toward the transition 
midpoint at —19.8 (qi). 



Previous work found slop es of around —0.1 for 
the u — V CM relation ijVisvanathan fe Sandaeet 
IBower et aU iTerlevich et alJ) . The difference in slope 
between u — r and u — V relations is negligible 
ijVisvanathan fc Sandaeej) and therefore there is a non- 
trivial difference between our measured slope and previ- 



ous work. The difference can largely be explained by the 
use of a T-function fit rather than a straight-line fit be- 
cause a straight-line fit to our data gives a slope of about 
-0.08. 

Other factors that could contribute to a difference 
in slope are: field versus cluster environment; Gaus- 
sian color function fitting versus E +SO morphologi- 
cal selection and; aperture effects ijScodeggiol 1200 It 
IBernardi et al J 120031) . However, the CM relation has 
been found to be similar between different environments 
ijSandage fc Visvanathant ITerlevich et alJ) and no signif- 
icant difference has been found between E and SO galax- 
ies in the CM relation ES andage fc Visvanathanf ) and, 
therefore, all morphological types that are genuinely part 
of the red distribution may have a similar relation. An 
analysis of the difference between using SDSS model and 
ot her magnitude d efinitions for the CM relation is given 
by IBernardi et alJ We note that SDSS model colors are 
weighted toward the center of a galaxy and therefore 
the relations p resented here apply to t hat weighting (see 
Sec. 4.4.5.5 of iStouehton et all 1.2002. for model fitting 
details). 

For the color dispersion-magnitude relation (Fig. EJ, 
we find only a modest slope at the bright end with low 
statistical significance (pi for tr r is about 1 standard de- 
viation from zero). This is consistent with the CM rela- 
tion for M r < —21 being due to a metallicity- lumi nosity 
correlation l)Faberlll9"73nKodama fc Arimotoll997j) since 
dust reddening and SFH correlations could also introduce 
more scatter. 

The dispersion-magnitude relation for the red distri- 
bution goes through a transition at the same magnitude, 
within the uncertainties, as the CM relation (Figs. El 
and El Table HJ . This is consistent with the transition 
being due to an increasing contribution from recent star 
formation with decreasing g alaxy luminosity (from M r of 
about -21 to -19; see also iFerreras fc Silkl l2000). The 
colors of younger stellar populations are more dependent 
on their ages than older populations (see e.g. fig. 1 of 
IBower et al.lll9 98). which implies more dispersion in a 
CM relation. In other words, if there has been on av- 
erage more recent star formation in a class of galaxies, 
then their mean color becomes bluer and the color disper- 
sion increases for any reasonable variation in their precise 
SFHs. However, we cannot rule out the transition also 
being caused by a metallicity-luminosity correlation as 
long as the metallicity dispersion increases w ith decreas- 
ing galaxy luminosity ijPoggianti et alJl^DOlj) . 

Note that our measurements of dispersion include ob- 
servational uncertainties. At the bright-end, the mea- 
sured dispersion is about 0.09, which is comparable to 
the observational uncertainties and is, therefore, con- 
sistent with an intrinsic dispersion of less than 0.05 
^Visvanathan fc Sa,nHageiri977t IBower et a.UI1 992hft. 

5.1.2. The blue distribution 

Color-magnitude relations for lat e-type galaxies are 
also an established phenomenon (|Vis^ina^han|_^8Jl 
Tullyet a,UlT982l IT^M IWv^ IT982VlPeletier fc de Criisl 

1998). Here, we precisely trace a CM relation over seven 
magnitudes and find that it is very well fit by a tanh 
function plus a straight line (Fig.EJ). 

For the low-luminosity blue-distribution galaxies 
( M r > —19), we find a shallow CM relation slope (-0.05) 
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that is consistent with a metallicity-luminosity correla- 
tion for the following reasons. Studies of late-type galax- 
ies yield a strong metallicity-luminosity relati on down 
to low luminosities from their emission lines l)Garnetti 
2002; Trcmo nti et all20031) and from their stellar content 
(|Bell fc de ,Tongll2000D . In addition, the general slopes 
of the CM relations for the red and blue distributions, 
defined by the p\ values (i.e. excluding the transition), 
are approximately the same (within < 2 standard de- 
viations). Modest correlations of luminosity with SFH 
and/or dust are also possible. 

Over the luminosity range from —19.5 to —22 (increas- 
ing galaxy luminosity) , we find a significant reddening of 
the blue sequence that is too steep to be explained en- 
tirely by a metallicity-luminosity correlation. This tran- 
sition can be ex plained by a combination of an increase 
in dust content ijGiovanelli et aljri995UTullv et alJll998j) 
and a decrease in recent star f ormation relative to the to - 
tal stellar mass of the galaxy (P eletier fc de Griisl ll998). 
These processes will have opposite effects on the disper- 
sion. Increased dust content will increase dispersion, be- 
cause of the range of reddening associated with different 
disk orientations, whereas decreased star formation will 
decrease dispersion because old stellar populations vary 
less in color (c.f. the luminous red distribution). Our in- 
terpretation of the dispersion-magnitude relation (Fig.EJ) 
is then that the dust content increase dominates the tran- 
sition from —19.5 to —20.8 (ub increases, increases) 
and that the competing processes approximately cancel 
from —20.8 to —22 (at, decreases slightly, /ib increases). 
This explains why the tanh fit for a\, does not coincide 
with that for /Zb (Table 0. We take the genuine transi- 
tion in the properties of the blue distribution to be that 
defined by the \x\, fit. 

5.2. Luminosity Functions 

The results of fitting the amplitudes of the double- 
Gaussian functions are used to determine the luminosity 
functions (Eqn. [7J while the mean and dispersion of the 
CM relations are constrained to be T functions (Eqn.[5J). 
The luminosity functions are shown in Figure and Ta- 
ble |21 To fit to these luminosity functions, we increase 
the errors slightly in order to avoid being overly con- 
strained by the high S /N bins, which could be dominated 
by systematic errors and in consideration of large-scale 
structure uncertainties. The results of fitting Schcchtcr 
functions are shown in Figure and Table Overall, 
about 42% of the r-band luminosity density is in red- 
distribution galaxies. Not surprisingly, this is slightly 
larger than the 38 % found to be in red-type galaxies by 
iHogg et alJ l)2002l) because their definition of red-type 
was based on strict cuts in color, concentration and sur- 
face brightness. 

Note that a single Schechter function was found to give 
a good fit to the red distribution but not to the blue 
distribution. In the latter case, there is a small but sta- 
tistically significant slope change around M r ss —20. To 
account for this, we used a double-Schechter function but 
with the same value for M* (i.e. the sum of two power 
laws with one exponential cutoff, it was not necessary 
to allow two-different M* values to provide a good fit). 
The double-Schechter function provided a noticeably bet- 
ter fit to the faint end of the luminosity function with a 
steeper faint-end slope (a' = —1.35, the second power 
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Fig. 7. — Luminosity functions for each galaxy distribution 
(crosses for <f> I (M r ) and squares for b (M r ), with error bars). The 
lines represent fits to the data. The dashed line for b and the 
solid line for r represent standard Schechter functions, while the 
solid line for 4>h represents a double Schechter function with a sin- 
gle value for M* . The standard single Schechter function does not 
provide a good fit to the blue distribution. The parameters for the 
single Schechter r and double Schechter 0j> fits are shown in the 
plot. 



Table 2 
Luminosity functions" 



M r — 5 log hro 


4>i h 70 


) 




4>b h 7 Q 






(mag 1 Mpc 3 ) 




(mag -1 Mpc -3 ) 




-23.25 


(2.06 ± 0.35) 


x 10" 


5 


(0.30 ±2.83) 


x 10" 


(i 


-22.75 


(0.99 ± 0.07) 


x 10" 


-4 


(2.45 ± 0.44) 


x 10" 


5 


-22.25 


(3.30 ± 0.15) 


x 10" 


-4 


(2.04 ±0.12) 


x 10" 


4 


-21.75 


(5.81 ± 0.24) 


x 10- 


-4 


(6.24 ±0.26) 


x 10" 


4 


-21.25 


(8.82 ± 0.33) 


x 10" 


4 


(1.18 ±0.04) 


x 10" 


3 


-20.75 


(1.14 ±0.04) 


x 10" 


-3 


(1.68 ±0.06) 


x 10" 


3 


-20.25 


(1.30 ± 0.05) 


x 10" 


-3 


(2.09 ± 0.07) 


x 10" 


3 


-19.75 


(1.27 ±0.05) 


x 10" 


-3 


(2.31 ±0.08) 


x 10" 


3 


-19.25 


(1.28 ±0.06) 


x 10" 


-3 


(2.85 ±0.10) 


x 10" 


3 


-18.75 


(1.18 ± 0.06) 


x 10" 


-3 


(3.48 ±0.13) 


x 10" 


3 


-18.25 


(1.12 ± 0.08) 


x 10" 


-3 


(4.74 ± 0.20) 


x 10" 


3 


-17.75 


(1.23 ± 0.12) 


x 10" 


-3 


(5.56 ±0.30) 


x 10" 


3 


-17.25 


(1.17 ±0.16) 


x 10" 


-3 


(6.27 ± 0.49) 


x 10" 


3 


-16.75 


(0.95 ± 0.22) 


x 10" 


-3 


(5.43 ± 0.70) 


x 10" 


3 


-16.25 


(1.01 ± 0.40) 


x 10" 


-3 


(0.99 ± 0.24) 


x 10" 


2 


-15.75 


(1.77 ±0.92) 


x 10" 


-3 


(1.27 ±0.54) 


x 10" 


2 



° Non-parametric luminosity functions for the red and blue 
distributions (see Eqn. The errors include formal and 
systematic uncertainties. The latter include a constant 3% 
plus a fraction proportional to 1/V max increasing to 40% for 
the lowest luminosity bin (to approximately account for large- 
scale structure effects). The functions are shown in Figure[7| 



law dominates here) compared to the single Schechter fit 
(a = —1.18). Note that this is purely a mechanism for 
obtaining a better fit to the luminosity function and it 
should not be interpreted as evidence for two blue pop- 
ulations. 

The red distribution has a significantly shallower faint- 
end slope (a — —0.83) than the blue distribution. Re- 
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Table 3 

schechter function fits to the luminosity functions 



distribution M* — 5 log h^Q 


<t>* h 7 Q 
(10 -3 Mpc" 3 ) 


a 


</>*' h£ 
(10~ 3 Mpc" 3 ) 


a' i + 2.51og/i 70 6 
(Mpc" 3 ) 


cp r -21.49 ±0.03 


2.25 ±0.08 


-0.83 ±0.02 




-14.79 (42%) 


<p h -20.60 ±0.08 


2.82 ±0.32 


+0.26 ±0.21 


2.35 ±0.37 


-1.35 ±0.05 -15.13 (58%) 


<f> h -21.28 ±0.03 


2.89 ±0.13 


-1.18 ±0.02 




-15.08 



a A single Schechter function was found to give a good fit to the red distribution (<^ r ) but not to the blue distribution (<pb)- In 
the latter case, a significantly better fit was obtained by summing two Schechter functions (with a single value for M*). Both 
the double- and single-Schechter function parameters are shown for 4>^. 

h The luminosity density in absolute magnitudes per Mpc 3 . The percentage in brackets is the fraction relative to the total 
r-band luminosity density. 



lated results have been found by divid ing; galaxies into 
classe s from early to late spectral types: IMadewick et alJ 
( 2002) found faint-end slopes from —0.5 to —1.5 based on 
emis sion and absorption l ine strengths in optical spectra, 
and: IBlanton et alT(|2001f) found a steepening of the slope 
from red to blue galaxies based on cuts in g—r color (their 
fig. 14). However, the equivalent steepening of the faint- 
end slope toward late types base d on morphological clas - 
sification appears less significant ijNakamura et all 2003). 
This is not inconsistent with our result since the red 
and blue distributions at the faint and bright ends need 
not have the same mix of morphological types. In other 
words, the processes that result in the red (or the blue) 
distribution also produce a range of morphological types 
that need not be the same at low and high luminosities. 

5.3. Dividing the Distribution 

One of the ways to divide a galaxy sample is by ab- 
solute magnitud e to compare, for e xample, galaxy clus- 
tering relations ijZehavi et al.ll20(5^l . Figure |S] shows the 
ratio between the luminosity functions as a function of 
absolute magnitude. The ratio of the red distribution to 
the total galaxy population gradually increases from low 
luminosities to M r « —22. For galaxies more luminous 
than this, the fraction of the population derived from the 
red distribution increases more rapidly. This agrees with 
the standard result that the mo st luminous galaxies are 
almost entirely early types (e.g. IBlanton et alll2003cj ). 

We can also look at divisions in color and absolute mag- 
nitude space. Figure [5] shows the bivariate distribution 
separated into the two types based on the analysis in this 
paper. In general, it is possible to define regions of the 
space that are almost entirely derived from one distribu- 
tion except, notably, for a region around M r ~ —21.5 and 
C ur ~ 2.5. Here, dusty and/or bulge-dominated spirals 
'overlap' in this space with old stella r-population ellipti- 
cals. A related result was obtained bv lHogg et al.l {2003), 
studying "the overdensities of galaxy environments as a 
function of luminosity and color" , where luminous red 
and faint red galaxies were found, on average, in more 
overdense regions than M* red galaxies (see their fig. 2). 
Our analysis can explain this result because of the blue 
distribution interlopers, formed by a different set of pro- 
cesses, with similar colors to the M* red distribution 
galaxies. 

Modeling a distribution with two sequences in this way 
naturally leads to a physical model with two kinds of 
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Fig. 8. — Fraction of galaxies that are part of the red distribution 
as a function of absolute magnitude: <j> T (M r ) / [<j> T (M r ) + </>b(M r )]. 
The diamonds with error bars represent the ratios determined from 
the non-parametric luminosity functions (Table|5J while the dashed 
line represents the ratios determined from the Schechter function 
fits (Table 0. 



galaxies with different processes associated with them. 
Therefore, it would be appropriate to study the prop- 
erties of each distribution separately. However, this is 
not precisely possible because of the dispersions and 
overlaps associated with each distribution. Instead, we 
can make an optimal divider by defining figure of mer- 
its based on the doub le-Gaussian description. Following 
IStrateva et al.l (|200 1T> . for any cut on color, we can es- 
timate the 'completeness' (C) and 'reliability' (1Z) of a 
sample. For example, if we use C ur > C' ur to select 
the red distribution then: C r is the fraction of galaxies 
from the red distribution that are selected, and; 1Z T is 
the fraction of galaxies selected that derive from the red 
distribution (i.e. 1 — 1Z r is the contamination from the 
blue distribution). 

There are many ways to define an optimum divider in 
color as a function of absolute magnitude based on dif- 
ferent weightings of completeness and reliability. They 
can be determined using the parameterized description of 
the data described in this paper. Here, we define an op- 
timum divider that best selects red distribution galaxies 
redder than the color cut and vice versa simultaneously, 
with a figure of merit defined by C r 7?. r Cb7?.b- This opti- 
mal divider (parameterized by a tanh function) is given 
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-23 -22 -21 -20 -19 -18 -17 -16 
M r - 5 log(h 70 ) 

Fig. 9. — Empirical red and blue bivariate dist ributions (<J> r and 
<&b) using the parameterization of Section 14,11 with values from 
Tables HI and 151 The solid contours represent the red distribution, 
while the dashed contours represent the blue distribution. The con- 
tour levels are on a logarithmic scale, starting at an arbitrary level 
and doubling every two contours. The thick dash- and- dotted line 
represents an optimal divider (Sec. 15^1 , Note that the measured 
dispersion includes observational uncertainties, which implies that 
the luminous red 'ridge' is, in reality, significantly narrower. 



by 



C' ur (M r ) = 2.06 - 0.244 tanh 



M r + 20.07 
L09 



(11) 



and is shown in Figures El and [§J The optimal color 
division varies from about 2.3 at the bright end to 1.8 
at the faint end of the galaxy distribution. For galaxies 
fainter than M r of -21, we obtain C r > 0.8, TZ r > 0.8, 
Cb > 0.85 and IZh > 0.95 at all magnitudes, but for more 
luminous galaxies, both Cb and 1Z V drop below 0.8 due 
to the increased overlap of the blue distribution with the 
red distribution (1Z T rises again for the most luminous 
two or three bins because of the thinning out of the blue 
distribution). 

5.4. Conversion to Stellar Mass 

In terms of relating variations in galaxy properties 
to models of galaxy formation and evolution, it is 
more appropriate to consider stellar mass than lumi- 
nosity because stellar mass is more closely related to 
baryon content. Stellar mass-to-light ratios (Ai/L) 
can vary by up to a factor of about ten for the r- 
band luminosity. However, Ai/L can be estimated by 
fitting population -synthesis models to colors or spec- 
troscopic indices llBrinchmann fc Ellis! 120001: iBell et aTl 
l2003al iKauffmann et al.ll2003al) . In order to convert our 
results to stellar-mass relations, we use an approximate 
color- Ai / L conversion given by 

\og{M/L r ) = a + bC ur (12) 

where (a, b) = (—0.55, 0.45) and; Ai and L r are the mass 
and specific luminosity in solar units. 8 This is a useful 

8 We use M for mass and M for absolute magnitudes. The con- 
version is given by \ or{MI Mg\) = {M r p , — M r )/2.5 + log(M/L r ) 
where M rQ = 4.62 IBlanton etalll200% . 



approximation because there is a significant correlation 
between u — r and Ai/L. 

The coefficients in Equation 1121 were derived from 
an averag e of analyses based on the stellar masses of 
■Bell et (12003^ and IKauffmann et~aTI ll2003l). for 
which we obtained (a, b) m (—0.3, 0.35) and (—0.8, 0.55), 
respectively, by fitting log (stellar mass) as a function of 
(u — r) mo doi for low rcdshift galaxies (z < 0.08). The 
assumed stellar IMFs were similar between the two anal- 
yses, 9 and therefore the differences aris e principally from 
the methodologies (see IBell et aTl and IKauffmann et all 
for details). This gives us some estimate of the system- 
atic uncertainties involved with this type of modeling. 
For the stellar-mass ranges quoted in this section (be- 
low), we use the a, b coefficients given above and include 
uncertainties from our fitting. Note that we do not in- 
clude uncertainties in the stellar IMF (or, e.g. evolution- 
ary tracks) , which could amount to ^30% uncertainty in 
the absolute values of the stellar masses, and the conver- 
sion to total mass is considerably more uncertain due to 
the dominance of dark matter in most galaxies. 

For simplicity, we apply the Ai / L adjustment 
(Ean.lT2*| to the relations and luminosity functions using 
the /z r and /ib values as a function of absolute magnitude 
(Table This is a reasonable adjustment for the av- 
erage galaxies in each distribution. Figure ITUl shows the 
luminosity functions adjusted for stellar mass-to- light ra- 
tios, in effect, galaxy stellar mass functions (GSMFs). 
The parameters for the Schechter fits are shown in the 
plot. The red distribution is shifted to higher masses 
with respect to the blue distribution. The stellar mass 
density per magnitude is dominated by the red distri- 
bution for galaxy stellar masses greater than about (2- 
5)xlO lo A4 . Overall, about 54%-60% of the stellar 
mass density is in red-distribution galaxies (depending 
on the coefficients of the approximate conversion to stel- 
lar mass). 

In Figure EH we al so plot the c olor-se lected early- and 
late-type GSMFs of IBell et all l|2003a|) . Notably, the 
early types have a significantly higher number density per 
magnitude relative to the late types around Ai* whereas 
the GSMFs from the double-Gaussian fitting have simi- 
lar number densities here. This reflects the fact that our 
analysis quantifies an overlap in color space (Fig.EJ), and 
thus enhances the 'late type' number density compared 
to a standard colo r selection, even if using a slope in g — r 
versus M r (as per B ell et all ). 

The transitions in galaxy properties occur around (1.5- 
2.2)xlO lo A^0 for the red distribution (qi for fi T and 
tr r ) and around (2-3) x 10 10 A4q for the blue distribu- 
tion (qi for /ib), based on converting the CM rela- 
tions (see Table Despite our simplistic treatment 
of Ai/L conversions (which do not differentiate be- 
tween dust attenuation and SFH effects), our transition 
masses are close to the transition in galaxy properties 
noted bvlKauffman n et alJ l)2003b|) that occurred around 
3 x 10 10 Air:). Here, we have resolved this transition into 



9 IBell et all used a 'diet' lSalpeterl Jl95^ IMF, which gives about 
70% of the M/L compared to a 'standard' Salpeter IMF, and 



IKauffm ann et al. used a iKroupa ( 2001) IMF (eqn. 2 of that pa- 
per). These IMFs wer e found to b e consisten t with cosmic SFH and 
luminosity densities tBaldrv & Glazcbrook 2003}), i.e. with aver- 
age g alaxy colors, and with galaxy rotation curves IBell de Jond 




Fig. 10. — Galaxy stellar mass functions, for each distribu- 
tion, derived from luminosity functions adjusted to account for 
variat ions in the stellar mass-to-light ratio as a function of color 
(Eqn. 1121 . Note that, as well as the x-axis being adjusted, the 
number density is adjusted to account for the stretching of the 
magnitude bins and the conversion to base-ten logarithms. The 
solid lines represent the Schechter functions fits to our data while 
dotted lines represent th e color-selected early- and late-type GSMF 
fits of lBell et ahl I2003al) . See Figure[7|for other details. 



three different effects, a change in dominance from one 
distribution to the other, a change in the properties of 
the red distribution and a change in the blue distribution. 

6. CONCLUSIONS 

We have devised a new method of analysing color- 
magnitude relations based on considering double- 
Gaussian distributions in color (Figs. and |3J rather 
than strict cuts on morphological or other properties. 
From this, we obtain CM and dispersion-magnitude rela- 
tions for two dominant red and blue distributions, which 
can in general be associated with classical definitions of 
early- and late-type galaxies. These relations are evident 
across seven magnitudes (Figs. [5] and 0) but are not well 
fit by a straight line. Instead, we find that a straight line 
plus a tanh function provides good fits (Eqn. El Tablc^). 

For both the red and blue distributions, we can asso- 
ciate the general trend (the straight line part of the com- 
bined function) with a universal metallicity-luminosity 
correlation. The tanh function can be associated with 
a transition in other properties of the galaxy popula- 
tion, which could include star-formation history and dust 
attenuation (in the case of late types). Note we have 
not proved the above physical explanations but have ob- 
tained them f rom previous results and analyses in the 
literature fe.g. |Zarf(;sky et al.||1994| iKycjama fc Arimot^ol 
| 1997t|Peletier fc de Griisll998HTullv et alll99SHGarnettl 
hooalKauffmann et al.l2003b|) . Further work is required, 
e.g. population synthesis fitting to SDSS spectra, to bol- 
ster and quantify the physical explanations for these re- 
lations. 

After converting to stellar mass, we find that the mid- 
points of the transitions parameterized by the tanh func- 
tions are around 2 x 10 10 Mq (Table QJ. In addition, 
we find that the number density per magnitude of the 



Fig. 11. — Simulated galaxy stellar mass functions, for each dis- 
tribution, from a simple merger model (solid line and thick dashed 
line for the red and blue distributions, respectively). See the Ap- 
pendix for details of the model. The merger scenario produces a 
shallower faint-end slope and a more massive characteristic mass 
for the red distribution compared to the blue distribution. The dot- 
ted lines represent the Schechter function fits to our data (Fig.rtn 
while the thin dashed line represents the initial function in the 
model. The parameters for the model are shown in the plot and 
relate to: initial faint-end slope (a); mass exponent for weight- 
ing in the merger model (/3); probability of a faint galaxy merging 
with a more massive galaxy (7), and; fractional increase in mass 
from merging used to determine which galaxies are part of the red 
distribution (<5). 



red distribution overtakes the blue distribution at about 
3 x 10 10 Mq (Fig.^J. These changes in properties of the 
galaxy populat ion are in good agreemen t with the tran- 
sition found bv lKauffmann et all (|2003bj) at 3 x 10 10 Mq 
using spectroscopic measurements. 

In order to study the physical properties of each dis- 
tribution separately, it is necessary to divide them. To 
do this, we defined an optimum divider based on min- 
imizing the overlap between the two Gaussian descrip- 
tions (Eqn. ^2 Fig.|51). We note that this works well for 
galaxies fainter than M r ~ — 21. For galaxies more lumi- 
nous than this, morphological indicators that also show 
a bimodality can work better at dividing the population 
into two types. Thus, a weighted combination of various 
measurements (from photometry, spectroscopy and mor- 
phology) could provide a better division by type, with 
the weights varying with absolute magnitude. 

The luminosity functions of the two distributions are 
significantly different from each other (Figs.[7|and|Sl Ta- 
bles |21 and 13) . The red distribution luminosity function 
has a shallower faint-end slope and a more luminous char- 
acteristic magnitude. The difference between the two 
distributions can be explained in terms of a merger sce- 
nario where the red distribution derives from more major 
mergers. To show this, we first approximately converted 
the luminosity functions to galaxy stellar mass functions 
(Fig. HUB and fitted a simple numerical model to the data 
(Fig. Illfl . Some discussion of mergers and a description 
of the model is given in the Appendix. This is consistent 
with hierarchical clustering theories. 

Finally, we note that further work could proceed in a 
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number of directions including: (i) denning an optimum 
division between the two distributions by combining vari- 
ous observed quantities; (ii) analysing the spectra of each 
distribution; (iii) studying the distributions of the mor- 
phological properties; (iv) comparing the CM relations 
between different galaxy environments, and; (v) simu- 
lating galaxy mergers and hierarchical clustering to test 
the cause of the bimodality. Here, we propose that the 
double-Gaussian fitting technique represents a model- 
independent way of defining a 'post-major-merger' se- 
quence, in that the uncertainties due to blue-distribution 
interlopers are quantified, without using a semi-arbitrary 
cut in morphology or spectral type. 
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APPENDIX 
A SIMPLE MERGER MODEL 

Early-type galaxies tend to have more spherical geometries, more virialized motions of stars, less dust as well as 
redder colors. N-body simulations suggest that the geometries and the motions of stars, similar to those observed in 
ellipticals, can be produced by galaxy mergers (|Barnesl H988: Barnes fc Hernauistlll992|). In addition, if the m erger 
causes the gas and dust to be expelled and/or used up in a burst of star formation t| Joseph &; Wright! 1 1985() then 
the galaxy's star-formation rate will be lower (at some later time) than star-forming late-type galaxies. This in turn 
will mean redder colors for galaxies produced by mergers as long as any induced star burst does not dominate the 
stellar population or the merger occurred at h i gh redshift, i.e. as long as mo st of the stars formed at high redshift 
ijBaugh. Cole, fc FrenldlT996t lKauffmannlll996|) . iKauffmann &: Chariot! l)1998f) have shown that a hierarchical merger 
model can reproduce the CM relation of cluster ellipticals. 

Given these lines of argument, it is reasonable to suppose that mergers are the cause of the bimodality, with the red 
distribution deriving from major merger processes and the blue distribution deriving from more quiescent accretion 
(with only minor mergers at most). To test this, we devised an illustrative non-dynamical merger model to see if the 
basic shapes of the GSMFs (Fig. mfy with respect to each other could be explained. The procedure for this model is 
described below. 



1. A population of galaxies is created with an initial baryonic-mass function described by a Schechter function 
with a faint-end slope a (for simplicity, we assume that all the baryons will be used to form stars and thus 
can be related to the GSMFs observed today). The population is defined from about 1CU 3 M* to 10 M* . The 
characteristic mass and number density, M.* and </>*, are adjusted to best match the data after the simulation. 

2. These galaxies are numbered from 1 to N ga \ s in order of increasing mass. 

3. For each galaxy i, it is determined whether it will merge with a more massive galaxy based on a probability 
equal to 

where M.j is the initial mass of the j-th galaxy. In other words, the probability is the sum of the more massive 
galaxies weighted by mass with an exponent (3, divided by the total mass in the population, multiplied by 7. 
The probability of the lowest mass galaxy merging with another galaxy is approximately 7. 

4. For each merged galaxy i, the mass is added to another galaxy at random but with a weighting proportional to 
A4j for j > i (and for j < i). 

5. For each remaining galaxy, the fractional increase of its mass relative to its initial mass is determined. Galaxies 
wi th fractional increases greater than 6 are determined to be in the red distribution (similar to the / C ni p parameter 
of Kauff mamTWhite. fc Guiderdonilll993T) . 
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6. The model GSMFs for the red and blue distributions are determined and M* and <$>* are adjusted to best fit the 
data, over the ranges 8.6 < \ogM* < 11.8 for the red and 8.4 < log.M* < 11.6 for the blue distribution. 

The additional physical assumptions behind this scenario are: that galaxies form from quiescent accretion with a 
distribution in masses defined by a Schechter function, and; the probability of merging with a more massive galaxy 
is related to the number density and masses of all these galaxies. The model is simple in the sense that it is non- 
dynamical, the timing of accretion and merging is not accounted for, and the parameter [3 hides the complex physics 
associated with forces on dark-matter haloes and their baryon contents. Note also that we do not model the CM 
relations, only the GSMFs. 

Figure ^] shows a best-fit example of the simulated GSMFs from this simple merger model. It reproduces the 
shape of the red-distribution GSMF with high accuracy and the approximate faint-end slope of the blue distribution 
(though the shape is slightly different). Thus, the different luminosity functions (or GSMFs) can be explained if the 
red distribution is derived from galaxies where more than a certain fraction of their mass has come from mergers rather 
than 'normal' quiescent accretion. In other words, the red distribution is a post-major-merger sequence where 'major' 
is determined by the ratios of the masses of the merging galaxies. This sequence could also include galaxies derived 
from the sum of many minor mergers, which could evolve a galaxy from a spiral to an SO l|Walker. Mihos. &: Hernquist! 

Dm}). 
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